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A FILTER AND A METHOD OF FILTERING ELECTRONIC MESSAGES 
INTRODUCTION 

The present Invention relates to a filter for filtering electronic messages e.g. for discharging 
spam mail from an electronic mailing system. In particular, the Invention relates to a filter for 
filtering electronic messages, said filter comprising: 

- storage space for an allowed list comprising Identification insignias of senders 
which have been approved for sending messages to a recipient, 

- means for receiving a first electronic message from a sender, 

* 

- means for capturing from the message an identification Insignia of the sender, 

- means for capturing from the message an identification insignia of the 
recipient, 

- first checic means for comparing the identification Insignia with the allowed list 
for determining either to withhold the message or to forward the message to 
the recipient, 

- means for storing the message and insignia, 

- means for generating a return mail to the sender in case the Identification 
insignia is not included In the allowed list, the means for returning the mall 
being adapted to Include in the returned mail a unique code, and a message 
for the sender to reply to the return mall by sending it back without changing 
the unique code, 

- means for storing the unique code and relating it to the insignia, and 

- second check means for receiving a second electronic message and for 
recognising the second message being a reply to the returned mall. 

Accordingly, the filter Is of the kind wherein a sender's address is compared with a list of 
allowed addresses, and if the address is not contained therein, a reply Is generated for the 
sender requesting completion of a registration .process. 
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BACKGROUND OF THE INVENTION 

■ 

In parallel with the increasing growth of popularity of electronic mailing systems for 
exchanging information globally, the desire of reducing the number of received messages by 
automatically filtering out unwanted messages has increased. Spam mail, I.e. mail messages 
which are forwarded to a large number of unknown recipients e.g. with the purpose of 
advertising or In general for distributing Information can slow down mail servers severely by 
occupying processor and storage resources, and, In worst case, the users may in annoyance 
over a large number of irrelevant messages overlook messages of importance. 

Filters of the above described kind exist already. In US 6,199,102 the prompt Is designed to 
require manual operation whereby an automatic reply set up by a computer system, e.g. a 
computer system adapted for forwarding messages to a large group of recipients, are filtered 
out. The required manual operation could be a required response to a question, e.g. "what Is 
the colour of the sky", or ''what is the current month". In such a system, there Is always a 
potential risk that persons of interest to the recipient get annoyed over the troubles, and e.g. 
after having given a wrong answer, e.g. by misspelling the colour of the sky, give up trying 
to reach the recipient. 



DESCRIPTION OF THE INVENTION 

It Is an object of the present Invention to enable filtering of undesired mails from desired 
malls without requiring burdensome Implication of the sending or the receiving part. 
Accordingly, the invention, in a first aspect, provides a filter of the above-mentioned kind, 
and further comprising: prioritising means which, in response to recognition of the second 
message being a reply to the returned mall, assigns a priority to each of the identification 
insignias of the senders of the first messages. The filter furirher comprising means for 
selecting Identification insignias and adding the selected insignias to the allowed list, wherein 
the means for selecting are adapted to carry out the selection according to the priorities 
assigned to the Identification insignias. 

Since the user is not prompted for any specific answer but only is requested to push a 
"Reply" button, merely nothing should prevent a user from continuing an attempt to send a 
desired electronic message to a recipient. Moreover, those who, in spite of the requirement 
for replying to the message, attempts to send spam mail will not get directly through to the 
recipient. Instead, Identification insignias of all replying senders are prioritised for furtiher 
selection of insignias which will enter the allowed list. As a consequence of the prioritising, it 
is possible to do the final selection, e.g. manually without exercising an excessive wori< load. 
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The filter thus combines two separate filtering processes, and the combination provides a 
reduced number of received spam maiis, and a reduced number of senders giving up an 
attempt to communicate a deslrabie message. 

Often, electronic mail systems, worl< with multiple addresses (IP addresses). On some 
5 addresses they can send out messages and on other addresses they can receive messages. 
The addresses are listed in the so called dns entries. It may therefore be an advantage to 
provide in the filter, means for capturing from the message a list with multiple addresses 
from which the sender has access to send messages. If sender Q uses a mall system with 5 
addresses, the filter should be able to obtain from this sender a list containing ail 5 
10 addresses. During the selection of insignlas to be added to the allowed list, ail 5 insignlas 
should be added in one process step. In that way, it can be prevented that sender Q caused 
. by an unsuspected use of another of the 5 addresses In a later attempt has to go through the 
complete acceptance procedure again. 

Sometimes, electronic messages are communicated between large groups of people, I.e. one 
15 specific mail could be send to a plurality of recipients, and each recipient can choose a ''Reply 
to All" command. In this case, acceptance of one person amongst a group of persons may • 
advantageously Imply an automatic acceptance of other people in that group. For that 
purpose, the filter may comprise means for generating a predict allowed list comprising 
Identification insignlas of third party mail recipients Included by the sender in the first or the 
20 second electronic message. This list can be used in rule based selection methods by the 
prioritising means. 

Normally, the prioritizing of the identification insignlas eases the adding of Insignlas to the 
allowed list for the person in charge of this task. However, In one preferred embodiment, the 
filter may comprise a rule based selection method which, based on recognition of specific 

25 patterns in the prioritised Insignlas or mail content can select specific Insignlas directly for the 
allowed list without user Intervention. As an example, such ruled based method could be 
adapted to enter ail insignlas except insignlas containing a specific domain name, e.g. 
"hotmali" or similar ''free of charge" domains. In another example, the rule based method 
could further be adapted to analyse the content of the message, e.g. for Identifying a specific 

30 frequency of occurrence of one or more predetermined Iceywords in an electronic message. . 
Accordingly, the following can exemplify the situation: 

Person Q sending from the domain DOTCOM PANYNAME is not on the allow list but the 
message content contains a word recognised by the rule and therefore the insignia Is added 
to the allow list and the mail is delivered to the recipient. As an example, the message may 
35 contain a word which indicates that the sender knows the recipient very well, e.g. the 
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message may contain names or expression only being used within a narrow group of people. 
In another example, a message contains a word which removes the associated insignia from 
an allowed list, e.g. words which indicates that it could be a spam mail. 

In a second aspect, the Invention provides a method for nitering messages and in accordance 
with the features described for the first aspect of the invention. 

■ 

DETAILED DESCRIPTION OF THE INVENTION 

In the following, the Invention will be described in further details with reference to the 
drawing in which: 

Figs. 1 and 2 show a diagram of a software implemented filter for filtering spam messages 
from desired messages, and 

Fig. 3 shows a screen dump from a computer implementation of the invention. 

The following description is based on the Implementation of the invention in a computer 
system for filtering e-mail messages. The implementation is done as a SMTP gateway but it 
could also have been Implemented as a part of a messaging system (e.g. I^icrosoft Exchange 
server, Lotus Notes or Novell GroupWise). Reference is further made to the '^Simple mail 
transfer protocol", Jonathan B. Postel, RFC 821 of August 1982 from Information Sciences 
Institute, University of Southern California, Marina del Rey, California and STANDARD FOR 
THE FORMAT OF ARPA INTERNET TEXT MESSAGES August 13, 1982 Revised by David H. 
Crocl<er Dept. of Electrical Engineering University of Delaware, Newark, DE 19711 Network: 
DCrocker @ UDel- Relay. 

Incoming Emails: 

An email Is received from the sender via the SMTP daemon on TCP port 25 (RFC 821) on the 
receiving server 1. The receiving server then extracts the sender Insignia from the Mail From 
on the SMTP protocol and the IP address of the sending server from the IP protocol 1. The 
receiving server then checks the insignia against the black list 2 If the Insignia Is Included in 
• the black list, the email Is rejects and the sender's server is Informed about this via an error 
message on the SMTP protocol 3, 

If the insignia is not included in the black list the receiving server checks the subject line 
from the mime message (RFC 822, I.e. STAI\IDARD FOR THE FORMAT OF ARPA INTERNET TEXT 



wo 2005/096584 



5 



PCT/DK2005/000219 



MESSAGES August 13, 1982, Revised by David H. Crocker, www.faqs.org/rfcs/rfc822.html) for a unique 
code and compares this code and the insignia with the list of previously generated l<ey pairs 
4. If the Incoming email does not contain a valid key pair the receiving server then checks If 
the Insignia Is on the allow list 13. If the Insignia is not on the allow list the receiving server 
will send back a reply request to the sender by doing the steps described In the following 
text. The receiving server generates an error message on the SMTP protocol that informs the 
sending server that the email could not be received because the sender is not registered as a 
valid sender of mail to the receiver 14. After this step, the SMTP communication is either 
closed by the sending server or a new email Is delivered to the receiving server. The 
receiving server then prioritizes the Incoming email according to a given rule set and assigns 
a priority value to the email message 15. If the assigned value is not exceeding a given 
threshold value a Unique ID is generated by the receiving system 18. The incoming email, 
insignia and Unique ID is then stored by the receiving server for later retrieval 19. The 
receiving server sends an email to the sender of the incoming email asking to reply to It 
without changing the subject line of the mime message (RFC 822) 20. 

If the Incoming email did get a priority value exceeding the given threshold value 16 the 
insignia of the sender Is added to the allow list 17. 

If the Incoming email sender insignia was on the allow list 13 then the receiving server 
captures the CC: from the mime message and stores them the predict allow list for later 
retrieval 11. The email Is then delivered to the recipient mall box or forwarded to the next 
step 12. 

If the email did contain a valid key pair 4 the receiving server will prioritize the email 
message and assign It a priority value 5. If the assigned value Is not exceeding a given 
threshold the choice of allowing the email is up to the administrator 7. If the email Is not 
approved by the administrator the original message, unique ID and Insignia Is deleted from 
storage space 8. If the email is approved by the administrator or had a priority value 
exceeding the threshold 6 the insignia Is added to the allow list 9. The receiving system then 
informs the sender by sending an email that he can now send emails to the recipient and that 
the previously sent email will be delivered 21.The receiving system captures the CC: from 
the mime message and stores them In the predict allow list for later retrieval 11. 

The email Is then delivered to the recipient mail box or forwarded to the next hop 12. Then 
the receiving system deletes the Email, Insignia and unique ID from storage 10. 
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Outgoing Emails; 

An email Is received from the sender via the SMTP daemon on TCP port 25 (RFC 821) on the 
server 23 from an Internal sender. The server adds the insignia of the recipient/recipients 
captured from the SMTP rcpt to: (RFC 821) on the Allow list 24. The email Is delivered to the 
5 recipients SMTP via TCP port 25 (RFC 821) 25. 

Description of technical terms In one specific implementation of the invention: 

The simplest form of an insignia is an email address or the domain part of the sender's email 
address. More complex inslgnlas can include an email address with or without a list of IP 
addresses from which a user can send. Alternatively, the Insignia may include a domain with 
10 or without a list for IP addresses from which the user from that domain can send. 

The domain part could be defined as the sub string starting after the sign and ending at 
the end of the email address Including the last character. 

The storage space could be ciefined as space on a harddlsk or space on a set of harddisks 
(Raid set) In which the inslgnlas are stored as records- on the insignia list (allow list and 
15 predict allow list). An allow list can In the simplest form consists of only sender Inslgnlas or 
more complex to allow fme grained control by making a list that includes a relationship 
between a sender Insignia and a recipient insignia. 

The system is connected to the Internet and running a SMTP daemon on TCP port 25 from 
which it can receive Incoming messages from senders connected to the same net. 

20 Using the SMTP protocol specified In RFC 821 the system captures the sender's email 

address. I.e. an identification insignia from the mail from: command. Further the system 
captures the sender's IP address from the TCP layer. By use of the SMTP protocol specified in 
RFC 821 the system captures the sender's email address Insignia from the rcpt to: command. 

The system compares the sender insignia with the Inslgnlas stored In the allow list using the 
25 same type of insignia record (simple or complex) as defined by the allow list record for a 
specific record. 

The Message and the Insignia could be stored in a space on a harddlsk or in a space on a set 
of harddisks (Raid set) in which the message and insignia is stored for later retrieval. 
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If the identification insignia is not Included in tlie allowed list, a retunn mall to the sender is 
generated In program memory and send to the sender via the STMP daemon using TCP PORT 
25.Thls return message subject field (RFC 822) Includes a 128 bit value code converted to a 
string plus a word that can be scanned for In the Incoming email messages (e.g. ID= 
5 C4389FBD-0872-4077-8FFF-0001901F1D89). The code Is generated by making a random 
128 bit number and then converting it to a string. Subsequently, the unique code and a 
relation between the Insignia are stored as a key pair in a space on a harddisk or space on a 
set of harddisks (Raid set). The Insignia is stored with a pointer to the code for later retrieval. 

Capturing the unique ID can be done by scanning the Incoming email messages subject field 
10 (RFC 822) for a word (e.g. ID=) and then capturing the substring containing the next 19 
characters after the ID=. 

Checking the validity for an Incoming unique id and insignia can be done by matching up the 
key pair represented by these two values and the key pair list generated by the first 
message. 

15 Prioritizing the message can be done by having static or dynamic lists of words, email 

addresses (predicts allow list) and/or domains which is assigned a weight value stored in a 
space on a harddisk or in a space on a set of harddisks (Raid set). The entities in the list are 
then compared to the content of the message and insignias word by word or Insignia by 
insignia. A value equal to the sum of the weights values for ail matches is attached to the 

20 message as a priority value. 

The choice of allowing the message could be up to the administrator. For this purpose a GUI 
is designed to display the needed information for the administrator to make the choice in a 
list sorted by the priority value -the higher the value the more Important the message is. The 
GUI in question is shown in Fig. 3. 

25 An internal sender can be detected by using the sender's IP address and matching it to a list 
consisting of predetermine address IP for internal sender. The list could be stored as records 
in a database on a space on a harddisk or in a space on a set of harddisks (Raid set). 



